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Abstract 

Motivated by problems of uncertainty propagation and robust estimation we 
are interested in computing a polynomial sublevel set of fixed degree and minimum 
volume that contains a given semialgebraic set K. At this level of generality this 
problem is not tractable, even though it becomes convex e.g. when restricted to 
nonnegative homogeneous polynomials. Our contribution is to describe and justify 
a tractable L^-norm or trace heuristic for this problem, relying upon hierarchies of 
linear matrix inequality (LMI) relaxations when K is semialgebraic, and simplifying 
to linear programming (LP) when K is a collection of samples, a discrete union of 
points. 

1 Introduction 

In this paper, we consider the problem of computing reliable approximations of a given 
set K C M". The set K is assumed to have a complicated shape (e.g. nonconvex, non- 
connected), expressed in terms of semialgebraic conditions, and we seek for approxima- 
tions which should i) be easy computable and ii) have a simple description. 

The problem of deriving reliable approximations of overly complicated sets by means of 
simpler geometrical shapes has a long history, and it arises in many research fields related 
to optimization, system identification and control. 

In particular, outer bounding sets^ i.e. approximations that are guaranteed to contain 
the set K, are widespread in the technical literature, and they find several applications 
in robust control and filtering. For instance, set-theoretic state estimators for uncer- 
tain discrete-time nonlinear dynamic systems have been proposed in [U [71 O [23] . These 
strategies adopt a set-membership approach [HI [2^ , and construct compact sets that are 
guaranteed to bound the systems states which are consistent with the measured output 
and the norm-bounded uncertainty. Outer approximation also arise in the context of 
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robust fault detection problems (e.g., see [T4]) and of reachability analysis of nonlinear 
and/or hybrid systems |13i I16j . 

Similarly, inner approximations are employed in nonlinear programming [21], in the so- 
lution of design centering problems [24j and for fixed-order controller design [12]. 

Recently, the authors of [5j have proposed an approach based on randomization, which 
constructs convex approximations of generic nonconvex sets which are neither inner nor 
outer, but they enjoy some specific probabilistic properties. In this context, an approxima- 
tion is considered to be reliable if it contains ''most" of the points in K with prescribed 
high probability. The key tool in this framework is the generation of random samples 
inside K, and the construction of a convex set containing these samples. 

In all the approaches listed above, several geometric figures have been adopted as ap- 
proximating sets. The application of ellipsoidal sets to the state estimation problem has 
been introduced in the pioneering work and used by many diflPerent authors from then 
on; see, for example, [71 19]. The use of polyhedrons was proposed in to obtain an 
increased estimation accuracy, while zonotopes have been also recently studied in [TJ [10] . 

More recent works, like for instance [H [121 HH], employ sets defined by semialgebraic 
conditions. In particular, in [19] the authors use polynomial sum-of-squares (SOS) pro- 
gramming, a particular class of SDP, to address the problem of fitting given data with a 
convex polynomial, seen as a natural extension of quadratic polynomials and ellipsoids. 
Convexity of the polynomial is ensured by enforcing that its Hessian is matrix SOS, and 
volume minimisation is indirectly enforced by increasing the curvature of the polynomial. 
In [4] the authors propose moment relaxations for the separation and covering problems 
with semialgebraic sets, thereby also extending the classical ellipsoidal sets used in data 
fitting problems. 

Our contribution is to extend further these works to cope with volume minimization of 
arbitrary (e.g. non-convex, non-connected, higher degree) semialgebraic sets containing 
a given semialgebraic set (e.g. a union of points). Since there is no analytic formula for 
the volume of a semialgebraic set, in terms of the coefficients of the polynomials defining 
the set, we have no hope to solve this optimization problem globally. Instead, we describe 
and justify analytically and geometrically a computationally tractable heuristic based on 
L^-norm or trace minimization. 

2 Problem statement 

Given a compact basic semialgebraic set 

K := {x G M'^ : gi{x) > 0, z = 1, 2, . . . , m} 

where gi{x) are real multivariate polynomials, we want to compute a polynomial sublevel 
set 

V{q) := {x G : q{x) < 1} D K 

of minimum volume that contains K. Set V(g) is modeled by a polynomial q belonging 
to P(i, the vector space of multivariate real polynomials of degree less than or equal to d. 
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In other words, we want to solve the foUowing optimization problem: 



infgep, volV(g) 
s.t. K C \{q). 



(1) 



In the above problem 



volV(g) := / dx = lY{q){x)d: 

Jv{q) Jr^ 



is the volume or Lebesgue measure of set V(g) and /x(^) is the indicator function, equal 
to one when x G X and zero otherwise. 

Note that, typically, the set K has a complicated description in terms of polynomials gi 
(e.g. coming from physical measurements and/or estimations) and set V(g) has a simple 
description (in the sense that the degree of q is small, say less than 10). Minimization of 
the volume of V(g) means that we want V(g) to capture most of the geometric features 



If K is convex and q is quadratic, then the infimum of problem ([T]) is attained, and there is 
a unique (convex) ellipsoid V(g) of minimum volume that contains K, called Lowner-John 
ellipsoid. It can be computed by convex optimization, see e.g. [21 §4.9]. 

In general, without convexity assumptions on K, a solution to problem ([T]) is not unique. 
There is also no guarantee that the computed set V(g) is convex. Optimization problem 
([T]) is nonlinear and semi-infinite, in the sense that we optimize over the finite-dimensional 
vector space P^i but subject to an infinite number of constraints (to cope with set inclu- 
sions). 

If we denote by 7r^(x) a (column vector) basis of monomials of degree up to rf, we can write 
q{x) = 7rJ{x)q where g is a vector of coefficients of given size. In vector space P^i, the set 
Q '•= {q ' 7Tj{x)q < 1} is by definition the polar of the bounded set {7Td{x) : ||x|| < R} 
where i? > is a constant chosen sufficiently large so that all vectors x G V(g) have norm 
less than R. As the polar of a compact set whose interior contains the origin, set Q is 
compact, see e.g. [18j. It follows that the feasible set of problem ([T]) is compact, and since 
the objective function is continuous, the infimum in problem ([T]) is attained. 

3 Convex conic formulation 

In this section it is assumed that g is a nonnegative homogeneous polynomial, or form, of 
even degree d = 26 in n variables. Under this restriction, in [17^ Lemma 2.4] it is proved 
that the volume function 



is convex in q. The proof of this statement relies on the striking observation [20] that 



where Cd is a constant depending only on d. Note also that boundedness of V(g) implies 
that q is nonnegative, since if there is a point Xq G such that q{xo) < 0, and hence 



of K. 



q ^ vol V(g) 
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Xq G V(g), then by homogeneity of q it foUows that q{Xxo) = A^^g(xo) < for all A and 
hence Xxq G V(g) for all A which contradicts boundedness of V(g). This implies that 
problem ([T]), once restricted to nonnegative forms, is a convex optimization problem. 

In pn Lemma 2.4] explicit expressions are given for the first and second order derivatives 
of the volume function, in terms of the moments 

/ x^e-^(^)rfx (2) 

for a G N^, |a| < 2d. In an iterative algorithm solving convex problem ([T]), one should 
then be able to compute repeatedly and quickly integrals of this kind, arguably a difficult 
task. 

When q is not homogeneous, we do not know under which conditions on q the volume 
function V(g) is convex in q. 

Motivated by these considerations, in the remainder of the paper we propose a simpler 
approach to solving problem ([T]) which is not restricted to forms, and which does not 
require the potentially intricate numerical computation of moments Our approach 
is however only a heuristic, in the sense that we do not provide guarantees of solving 
problem ([T]) globally. 



4 L^-norm minimization 

Let us write V(g) as a polynomial superlevel set 

U(p) := \{q) = {xeW : p{x) := 2 - q{x) > 1}. 

with polynomial 

p{x) = 7rJ{x)P7rs{x) (3) 

expressed as a quadratic form in a given (column vector) basis 7Ts{x) of monomials of 
degree up to S := [|], with symmetric Gram matrix P. Then, optimization problem ([T 
reads 



V 



min^^P^ volU(p) 



s.t. K C U(p). 

Note that in problem Q we can indifferently optimize over coefl&cients of p or coefficients 
of matrix P, since they are related linearly via constraint ([3]). 

Since K is compact by assumption and \J{p) is compact for problem Q to have a finite 
minimum, we suppose that we are given a compact semialgebraic set 

B := {x G M"" : bi{x) > 0, i = 1, 2, . . . , m J 

such that U(]9) C B and hence 

U{p) = {x G B : p{x) > 1}. 

The particular choice of polynomials bi will be specified later on. Now, observe that by 
definition 

p{x) > Iu^p){x) on W 
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and hence, integrating both sides we get 

/ p{x)dx > / /u(p)(x)rfx = vol U(]9), 
Jb Jb 

an inequahty known as Chebyshev's inequahty, widely used in probability, see e.g. [21 
§2.4.9]. If polynomial p is nonnegative on B then the above left-hand side is the L^-norm 
of p^ and the inequality becomes 



|i>volU(p). (5) 

Now consider the following L^-norm minimization problem: 



'= mm^^p, iipiii 

s.t. j9 > on B (6) 

p > 1 on K. 

Lemma 1 The minimum of problem ^ monotonically converges from above to the min- 
imum of problem Q)^ i.e. w^_i ^w^>v^ for all d, and lim^^^oo'^^ lim^i^oo'^^- 

Proof: The graph of polynomial p lies above /k, the indicator function of set K, while 
being nonnegative on B, so minimizing the L^-norm of p on B yields an upper approx- 
imation of /k- Monotonicity of the sequence w""^ follows immediately since polynomials 
of degree d + 1 include polynomials of degree d. When its degree increases, p converges 
in L-'^-norm to /k, hence \\p\\i converges to vol K. The convergence is pointwise almost 
everywhere, and almost uniform, but not uniform since /k is discontinuous on B. □ 

Note that this L^-norm minimization approach was originally proposed in [11] to compute 
numerically the volume and moments of a semialgebraic set. 



5 Trace minimization 

In this section we give a geometric interpretation of problem ([6]). First note that the 
objective function reads 



; Jb 

= trace J 7Ts{x)7tJ {x)dx^ = trace PM 



where 

M := tis{x)tiJ {x)dx 



B 



is the matrix of moments of the Lebesgue measure on B in basis Ti5{x). In equation ^ if 
the basis is chosen such that its entries are orthonormal w.r.t. the (scalar product induced 
by the) Lebesgue measure on B, then M is the identity matrix and inequality ^ becomes 

trace P > volU(j9) (7) 
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which indicates that, under the above constraints, minimizing the trace of the Gram 
matrix P entails minimizing the volume of \J{p). 

The choice of polynomials bi in the definition of the bounding set B should be such that 
the objective function in problem ^ is easy to compute. If 

a 

then 



and we should be able to compute easily the moments 

Va '= / [nd{x)]adx 
Jb 

of the Lebesgue measure on B w.r.t. basis 7Td{x). This is the case e.g. if 5 is a box. 



Remark 1 (Minimum trace heuristic for ellipsoids) Note that, in the case of quadratic 
polynomials, i.e. d = 2, we retrieve the classical trace heuristic used for volume mini- 
mization, see e.g. [6]. If B = [—1,1]^ then the basis 7ri{x) = ^x is orthonormal w.r.t. 
the Lebesgue measure on B and \\p\\i = |traceP. The constraint that p is nonnegative on 
B implies that the curvature of the boundary of\J{p) is nonnegative, hence that U{p) is 
convex. 



In [19], the authors restricted the search to convex polynomial sublevels \J{p) = V(g) = 
{x : q(x) < 1} by enforcing positive semidefiniteness of the Gram matrix of the quadratic 
Hessian form of q. They proposed to maximize (the logarithm of) the determinant of 
the Gram matrix, justifying this choice by explaining that this increases the curvature 
of the polynomial sublevel set along all directions, and hence minimizes the volume. 
Supported by Lemma [l] and the above discussion, we came to the consistent conclusion 
that minimizing the trace of the Gram matrix of p, that is, maximizing the trace of the 
Gram matrix of g, is a relevant heuristic for volume minimization. Note however that in 
our approach we do not enforce convexity of U{p). 



6 Handling constraints 



If K = {x : gi{x) > 0, z = 1,2, ...,m} is a general semialgebraic set then we must 
ensure that polynomial p — 1 is nonegative on K, and for this we can use Putinar's 
Positivstellensatz and a hierarchy of finite-dimensional convex LMI relaxations which are 
linear in the coeflBcients of p. More specifically, we write p — 1 = r^ + J2i ^i9i where 
To, ri, . . . , are polynomial sum-of-squares of given degree, to be found. For each fixed 
degree, the problem of finding such polynomials is an LMI, see e.g. [17^ Section 3.2]. The 
constraint that p is nonnegative on B can be ensured similarly. 

A particularly interesting case is when K is a discrete set, i.e. a union of points G M^, 
i = 1, . . . , A^. Indeed, in this case the inclusion constraint K C U(]9) is equivalent to a 
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finite number of inequalities p{xi) >1, i = l,...,A^ which are linear in the coefficients 
of p. Similarly, the constraint that p is nonnegative on B can be handled by linear 
inequalities p{xj) > enforced at a dense grid of points G B, j = 1, . . . , M, for M 
sufficiently large. Note that, with this pure linear programming (LP) approach, it is not 
guaranteed that p is nonnegative on B, but what matters primarily is that K C U(]9), 
which is indeed guaranteed. This purely LP formulation allows to deal with problems 
with rather large A^. 



7 Examples 

We illustrate the proposed approach for the case when K is a discrete set, a union of points 
of M^. For our numerical examples, we have used the YALMIP interface for Matlab to 
model the LMI optimization problem ([T]), and the SDP solver SeDuMi to solve numerically 
the problem. Since the degrees of the semialgebraic sets we compute are typically low (say 
less than 20), we did not attempt to use appropriate polynomial bases (e.g. Chebyshev 
polynomials) to improve the quality and resolution of the optimization problems, see 
however [11] for a discussion on these numerical matters. 



7.1 Line (n = 1) 

To illustrate the behavior of the proposed optimization procedure, we first consider B = 
[— 1, 1] and K = { — 1,0, |}. On Figure l] we represent the solutions p of degrees 2, 7, 
17 and 26 of minimization problem ([g]). We observe that the superlevel set U{p) = {x G 
B : p{x) > 1} is simply connected for degree 2, doubly connected for degree 7, and triply 
connected for degrees 17 and 26. Note that, as the degree increases of p, the length of the 
intervals for which p{x) > 1 tends to zero. This is consistent with the fact that the volume 
of a finite set is zero. We also observe that polynomial p shows increasing oscillations for 
increasing degrees, a typical feature of such discontinuous function approximations. 



7.2 Plane (n = 2) 

In B = [—1, 1]^ we consider two clouds of 50 points each, i.e. = 100. On Figure [2] we 
represent the solutions p of degrees 2, 5, 9 and 14 of minimization problem ([g]). Here too 
we observe that increasing the degree of p allows to disconnect set U(p). The side effects 
near the border of B on the top right figure can be removed by enlarging the bounding 
set B. 



7.3 Space (n = 3) 

In B = [—1, 1]^ we consider = 10 points. The solutions p of degrees 4, 10, and 14 of 
minimization problem ([g]) is depicted in Figure [3| Here too we observe that increasing 
the degree of p allows to capture point clusters in distinct connected components. 

It should be pointed out that all the semialgebraic sets in the previous examples were 
computed in a few seconds of CPU time on a standard PC. 
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Figure 1: Minimum L^-norm polynomials p (red) such that p > on [— 1, 1] and p > 1 
at the 3 points (black), for degree 2 (upper left), 7 (upper right), 17 (lower left) and 26 
(lower right). 

8 Conclusion 

In this paper, we proposed a simple technique for approximating a given set of compli- 
cated shape by means of the polynomial sub level set of minimal size that contains it. 
The proposed approximation has been shown to be convex in the case the polynomial 
is assumed to be homogeneous. Then, a tractable relaxation based on Chebychev's in- 
equality has been introduced. Interestingly, this relaxation reduces to the classical trace 
minimization heuristic in the case of quadratic polynomials, thus indirectly providing an 
intuitive explanation to this widely used criterion. 

Ongoing research is devoted to utilize the proposed approximation to construct new classes 
of set-theoretic filters, in the spirit of the works [U [9]. 
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Figure 2: Minimum L^-norm polynomials p (red) and low surface semialgebraic sets 
(yellow) such that p > on [— 1, 1]^ and p > 1 at 100 points (black), for degree 2 (upper 
left), 5 (upper right), 9 (lower left) and 14 (lower right). 




Figure 3: Including the same 10 space points (black) in low volume semialgebraic sets 
(yellow) of degree 4 (left), 10 (center), 14 (right). 
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